Error-corrective discriminative joint decoding of automatic spoken language transcription and understanding
نویسندگان
چکیده
Following recent trends in the development of spoken dialogue systems, this paper proposes to improve the performance of the user’s intent extraction by means of joint decoding of automatic spoken language transcription and understanding. Gains are expected not only from a better connectivity and mutual awareness of both tasks but also through the use of discriminant models and integration of an error-corrective intermediate mechanism. This latter is based on a statistical post-editing of the speech recognizer word lattice and conditional random fields instantiate the former in our system. An overall absolute reduction of 1.1% is observed by direct application of the proposed techniques on the MEDIA task.
منابع مشابه
Corrective Tuning by Applying Lvq for Continuous Density and Semi-continuous Markov Models
In this work the objective is to increase the accuracy of speaker dependent phonetic transcription of spoken utterances using continuous density and semi-continuous HMMs. Experiments with LVQ based corrective tuning indicate that the average recognition error rate can be made to decrease about 5% { 10%. Experiments are also made to increase the eeciency of the Viterbi decoding by a discriminati...
متن کاملAn error-corrective language-model adaptation for automatic speech recognition
We present a new language model adaptation framework integrated with error handling method to improve accuracy of speech recognition and performance of spoken language applications. The proposed error corrective language model adaptation approach exploits domain-specific language variations and recognition environment characteristics to provide robustness and adaptability for a spoken language ...
متن کاملConcept segmentation and labeling for conversational speech
Spoken Language Understanding performs automatic concept labeling and segmentation of speech utterances. For this task, many approaches have been proposed based on both generative and discriminative models. While all these methods have shown remarkable accuracy on manual transcription of spoken utterances, robustness to noisy automatic transcription is still an open issue. In this paper we stud...
متن کاملOptimization on decoding graphs by discriminative training
The three main knowledge sources used in the automatic speech recognition (ASR), namely the acoustic models, a dictionary and a language model, are usually designed and optimized in isolation. Our previous work [1] proposed a methodology for jointly tuning these parameters, based on the integration of the resources as a finite-state graph, whose transition weights are trained discriminatively. ...
متن کاملExperiments on Error-corrective Language Model Adaptation
We present a new language model adaptation framework integrated with an error handling method to improve accuracy of speech recognition and performance of spoken language applications. The proposed error corrective language model (ECLM) adaptation approach exploits recognition environment characteristics and domain-specific semantic information to provide robustness and adaptability for a spoke...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013